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We propose a model of network growth that generalizes the deactivation model previously sug- 
gested for complex networks. Several topological features of this generalized model, such as the 
degree distribution and clustering coefficient, have been investigated analytically and by simula- 
tions. A scaling behavior of clustering coefficient C ~ 1/M is theoretically obtained, where M 
refers to the number of active nodes in the network. We discuss the relationship between the 
recently observed numerical behavior of clustering coefficient in the coauthor and paper citation 
networks and our theoretical result. It shows that both of them are induced by deactivation mech- 
anism. By introducing a perturbation, the generated network undergoes a transition from large- 
to small- world, meanwhile the scaling behavior of C is conserved. It indicates that C ~ 1/M is a 
universal scaling behavior induced by deactivation mechanism. 

PACS numbers: 89.75.Hc, 87.23.Ge, 89.65.-s, 89.75.Fb 



I. INTRODUCTION 

Many social, biological, and communication systems 
can be properly described as complex networks with 
nodes representing individuals or organizations and links 
mimicking the interactions among them[l-3]. Examples 
are numerous: these include the Internet [4,5], the World 
Wide Web [6, 7], biological networks [8, 9], food webs [10], 
social networks [11], etc. Recent empirical studies indi- 
cate that the networks in various fields exhibit some com- 
mon topological characteristics: a small average distance 
as random networks, large clustering coefficient as regular 
networks (small- world property) [12] and a power-law de- 
gree distribution (scale- free property) [13]. The ubiquity 
of complex networks has inspired tremendous investiga- 
tions on them. Among these flourishing researches, the 
effect of aging is of particular interest [14-18], since it is a 
universal mechanism in reality. For instance, in the movie 
actor collaboration network, the more famous an actor is, 
the more chances he will have to act in new movies. But, 
no matter how famous he may be, every star will become 
gradually inactive as time goes on. This aging effect can 
greatly influence the evolution of networks and results in 
peculiar network structural property[14,15]. 

Recently, Borner et al. introduced a general process 
model that simultaneously grows coauthor and paper ci- 
tation networks [19], in which the core assumption is that 
the twin networks of scientific researchers and academic 
articles mutually support one another. In their model, 
each of the authors and papers is assigned a topic, and 
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authors read, cite, produce papers or coauthor with oth- 
ers only in their own topic area. Interestingly, they found 
that the clustering coefficient C of the simulated paper 
citation network is linearly correlated with the number 
of topics. We note that the main underlying dynamic 
rule governing the evolution of the network is aging. For 
example, due to the lifespan of human, once authors are 
older than a specified age, they will be set deactivated, 
and do not produce papers or coauthor with others any 
longer. Furthermore, papers cease to receive links when 
their contents are outdated. Therefore, these consider- 
ations motivate us to theoretically investigate the effect 
of aging on the clustering coefficient of the network. In 
the present paper, we concentrate on this ingredient of 
self-organization of the coauthor and paper citation net- 
works and propose a simple generalized model, in which 
the main dynamic is deactivation mechanism. We will 
demonstrate that the behavior of clustering coefficient C 
in the coauthor and paper citation networks is universal 
in networks generated by deactivation mechanism. 



This paper is organized as follows. In section II, the 
model is introduced. In section III, we give both the 
numerical and analytic results about the effect of deacti- 
vation mechanism on network structure, including degree 
distribution (Sec. Ill A) and clustering coefficient (Sec. 
Ill B). An interesting scaling behavior of C is obtained. 
In section IV, a structural perturbation is introduced. 
We show that the perturbation leads to a structural tran- 
sition from large- to small- world (Sec. IV A), while the 
scaling behavior of C is conserved (Sec. IV B). Finally in 
section V, we discuss the relationship between our result 
and the behavior of clustering coefficient in the coauthor 
and paper citation networks and give a summary . 
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II. THE MODEL 



In the present model, each node can be in two differ- 
ent states: active or inactive[18, 20,21]. The evolution 
process starts with a one-dimensional lattice consisting 
of M active nodes with periodic boundary condition and 
coordination number 2z[22]. Then, in each time step 

(1) Add a new node into the network, and connect it 
to m nodes randomly chosen from the M active ones. 

(2) Activate the new node. 

(3) Deactivate one of the active nodes. The probability 
that the node i is deactivated is given by 



(l) 



where the normalization factor is defined as a = 
(52jeA V^') -1 - The summation runs over the set A of 
the currently M + 1 active nodes. 

It is worthwhile to note that, when M = m and 
z = [4f], the present model reduces to the famous de- 
activation model introduced by Klemm and Eguiluz (KE 
model) [18]. For convenience, we call this generalized de- 
activation model GKE model. 
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FIG. 1: Illustration of the normalization factor a as a function 
of time t with the parameters m — 10 and M = 30. The 
amplified version can be seen in the inset. The data points 
correspond to system size N = 2 x 10 4 , and each is obtained 
as an average of 100 independent runs. 



III. STRUCTURAL PROPERTIES 

A. Degree distribution 

By using the continuous approximation similar to that 
used in Ref. [18], the degree distribution P(k) can be 
obtained analytically for GKE model. Let us first derive 
the degree distribution (k) of the active nodes at time 
t. It evolves according to the following master equation: 



P 



(t+i) 



(fc + 1) 



TT(fc)] 



+ pW(fc + l)[l-_][l- 7r (fc + l)].(2) 

On the right side of Eq. (2), the first term accounts for 
the process in which an active node with degree k at time 
t is connected to the new node and not deactivated in the 
next time step; The second term indicates the process 
that an active node with degree k + 1 at time t is not 
connected to the new node and still active in the next 
time step. 

We investigate the behavior of a in time evolution. 
Fig. 1 shows the dependence of the normalization fac- 
tor a on time t. We find that a approaches a stable 
value with certain fluctuations as soon as the evolution 
of the network starts. We assume that the fluctuations 
of the normalization factor a are small enough, i. e., it 
can be treated as a constant. Then, the stationary case 
p(t+i)(fc) = p (*)(jfe) of Eq. (2) yields 



p(k + l)-p(k) = 



-a — (7 — l)k 
k(k + 7 — a) 



P(k), 



(3) 



where 7 = ^M. 7 \ Treating k as continuous we write 
down the equation 



dp —a — (7 — l)k 
dk k(k + 7 — a) 

which yields the solution 

p(k) - fc- 7+1 . 



P(k), 



(4) 



(5) 



When the system size N is large compared with M, the 
degree distribution of the whole network P(k) can be ap- 
proximated by considering the inactive nodes only. Thus 
P(k) can be calculated as the rate of the change of the 
degree distribution p(k) of the active nodes. We find 



P(k) 



dp 
"dk 



(6) 



where c = (7 — l)m 7_1 is the normalization constant. 
Finally, the exponent 7 = 3 is obtained from a self- 
consistent condition 



poo 

2m = kP(k)dk. 

J m 



(7) 



The exponent 7 can be tunable if we introduce the 
initial attractiveness just like that of the model in Ref. 
[18]. Since it is not our focus, we will not show this effect 
here. 

In Fig. 2, we plot the cumulative degree distribution 
of GKE networks by simulations. We obtain a power 
law scaling with best-fitted exponent 7 — 1 = 1.96 ± 
0.02, which is in agreement with the analytical result. In 
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FIG. 2: (Color online) Cumulative degree distribution of GKE 
networks with parameters (a) M — 20; m — 4 (squares), 
8(upward triangles), 14 (downward triangles), 20 (circles) and 
(b) m = 20; M = 20 (squares), 40(upward triangles), 80 
(downward triangles), 100 (circles). The data points corre- 
spond to system size N — 2 x 10 4 , and each is obtained as 
an average of 100 independent runs. The two dash lines have 
slope —2.0 for comparison. 



fact, the exponent 7 is dependent on m[20], which can be 
ignored when m is large. However, the number of active 
nodes M has no effect on degree distribution exponent 
7, which is analytically and numerically obtained. 



B. Clustering coefficient 

The clustering coefficient C(l) of node I with degree fa 
can be defined as follow: 



C(l) 



2E(l) 
ki(k - 1)' 



(8) 



where E(l) is the number of links between neighbors of 
node I. 

According to the definition of the GKE model, when 
a new node with m links is added into the network, the 
links are attached to the nodes randomly selected from 
the active ones. Thus, the probability that two arbitrary 



active nodes are connected is 
/ with degree ki = m has 



M ' 



E{1) = 



m ki(ki 



M 



It follows that a node 



(9) 



If I is deactivated in the time step of its generation its 
neighborhood does not change any more and C'(l) keeps 
stable. Otherwise, node I is not deactivated. In the next 
time step, a new node j is added. As we note, the prob- 
ability that node j makes connection to I is equal to the 
probability that one of the neighbors of node I is deac- 
tivated in the last time step. We assume that if fa is 
added by 1, one of its active neighbors has already been 
deactivated in the last time step. Thus, when the newly 
added node is connected to node I, one of its neighbors s 
is inactive and one possible link between the newly added 
node and s is missed. Then we have 



E{1) 



m r fajki - 1) 
M 2 



-1], 



(10) 



where fa = m + 1. Also, if fa — m + 2, there will be 2 
inactive nodes in the neighbors of node I causing another 
2 possible links to be missed. Thus we obtain 



m ,fa{fa — 1) 

E(l) = — - 1 - 2 , 

M 2 J ' 



(11) 



where fa — m + 2. This process repeats until node I 
is deactivated, whose neighborhood does not change any 
more. By induction, we have 



w M v 2 



1) 



h\ —m 

E • 



(12) 



Thus the clustering coefficient C(l) depends only on the 
degree fa . The exact relation is 



m + l)(fa — to), 
fa(fa-l) J 



(13) 



The clustering coefficient C of the whole network is the 
average of C(l) over all nodes, i. e., 



1 N 

c = ±y™[i- 

N ^ M l 

1=1 



(fa — TO + l)(fa — TO.) 



ki(fa - 1) 

Writing Eq. (14) in continuous form yields 
(k -to + l)(fc - 



(14) 



C 



M 



k(k-l) 



P(k)dk, (15) 



where P(k) is the degree distribution which we have de- 
rived above. Finally, the result is 



M L 6 30 v n 



(16) 



Obviously, when M = to, the clustering coefficient of the 
KE model is recovered [23]. 
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FIG. 3: Illustration of the average clustering coefficient C as 
a function of system size N with the parameter m = 4 and 
M — 10. The clustering coefficient C approaches a stationary 
value about 0.31, which is precisely predicted by Eq. (16). 
Each data point is obtained as an average of 1000 independent 
runs. 



FIG. 4: (Color online) The analytical result of the clustering 
coefficient of GKE network, C(M), as a function of M (line), 
in comparison with the simulation (circles) results. Other 
parameters for the simulation are m = 10 and N = 2 x 10 4 . 
Each data point is obtained as an average of 100 independent 
runs. 



From Eq.(16), we know that the clustering coefficient 
C is independent of the system size N. This asymptotic 
behavior of C is reported in Fig. 3. In the limit of large 
N, the clustering coefficient C gets to an stationary value 
of 0.31, which agrees with the analytical result. 

It is important to point out that the clustering coef- 
ficient has an novel scaling behavior C ~ 1/M. Exten- 
sive numerical simulations perfectly confirm this result 
(see Fig. 4). This behavior can be related to the re- 
cent numerical study on the coauthor and paper citation 
networks, which will be discussed in Sec. V. 



IV. STRUCTURAL PERTURBATION 
A. Structural transition 

We introduce a structural perturbation to the GKE 
model by modifying step (1) of the definition as follow: 
Add a new node with m links to the network. With 
probability p, attach one of the new node's links to a 
randomly selected inactive node. The other links are then 
attached to nodes chosen randomly from the M active 
ones. We will show that the perturbation will lead to 
a phase transition [24] from large- to small- world in the 
network without changing the scale-free property. 

In GKE model, each node can be represented by the 
time step of its generation. It is clear that, when p = 
the GKE network is structured[18], i. e., the time or- 
dering exists and the mean field manner is absent[20,25]. 
We denote l(t) as the average distance for pairs of nodes 



separated by time interval t. Fig. 5 shows the simulation 
results of the variation of l(t) with perturbation param- 
eter p. It can be found that, when p = 0, l(t) increases 
linearly with t, i. e., the time ordering indeed exists. 
Since the nodes in the network are uniformly distributed 
on time axis, we can easily obtain that the average dis- 
tance L is linearly correlated to the system size N, i. e., 
L oc N, which indicates the absence of small world ef- 
fect. However, once p is a small finite value, l(t) becomes 
independent of time interval t, i. e., the time ordering 
vanishes. Meanwhile, all nodes with the same degree 
can be considered to be statistically equivalent, and the 
mean-field manner is recovered. 

Let d(i,j) denotes the distance between node i and 
node j, and thus the average distance of the model with 
system size N is 



L(N) = 



2a(N) 
N(N - 1) ' 



where the total distance is 



a(N)= Yl d &j)- 

\<i<j<N 



(17) 



(18) 



Intuitively, when a new node is added, the distance be- 
tween old nodes will not increase. Hence wc have 



N 



a{N + 1) < a(N) + ^ d(i, N + l), 



(19) 
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FIG. 5: (Color online) Illustration of l(t) as a function of 
time interval t, with perturbation p = 0.00 (solid line), 0.01 
(dashed line), 0.05 (dotted line), and 0.10 (dot-dashed line). 
Other parameters for the simulations are m = 3, M = 10, 
and N = 8000. Each data point is obtained as an average of 
50 independent runs. 



thus 



N 



a(N + 1) < a(N) + V d(i, x) + N, 



i=l 



(20) 



where x is the active node connected to the newly added 
one. Since p is nonzero, by using mean-field approxima- 
tion[26,27], we have 



Y,d(i,x)nL(N)(N-l). 

Thus, the inequality (20) reduces to 

a(N + 1) < a(N) + + N. 

Rewriting (22) in continuous form will yield 
da(N) 2a(N) 



dN 



< 



N 



which leads to 



a(N) <N 2 lnN + B, 



(21) 



(22) 



(23) 



(24) 



where B is a constant. As a(N) ~ N 2 L(N) and N is 
sufficiently large, we obtain L{N) < IniV, i. e., the in- 
creasing tendency of L(N) is not faster than In N, which 
predicts the presence of small-world property. 

In fact, the GKE network is similar to a chain of dense 
clusters locally connected, i. e., it is like a regular lat- 
tice in topological view. For this peculiar topology, all 



FIG. 6: (Color online) Illustration of the average distance L 
as a function of N, with p — 0.00 (squares) and p = 0.01 
(circles). When a perturbation p — 0.01 is introduced, L 
grows logarithmically with N. The values can be fitted well by 
a straight line, which is typical of the small world effect. Other 
parameters for these simulations are m = 4 and M — 10. 
Each data point is obtained as an average of 100 independent 



of the links in the network are local. When a pertur- 
bation is introduced, the network undergoes a cross-over 
from structured network to unstructured network. Actu- 
ally, the perturbation just means that, with a probability, 
every node rewires one of its local links to a randomly se- 
lected node, which is precisely the definition of the model 
proposed by Watts and Strogatz[12]. That is to say, the 
cross-over is just the small- world phase transition[24]. 

In Fig. 6, we show the dependence of average dis- 
tance L on system size TV with p — 0.00 and p = 0.01 
in GKE network. For p = 0.00, the average distance 
grows linearly L oc N, the same behavior observed in 
one-dimensional regular lattices. Once p is a small fi- 
nite value, L becomes logarithmic related to N, i. e., 
L oc In AT. The logarithmic increase of average distance 
with system size predicts that the phase transition from 
large- to small-world occurs, which is in agreement with 
the analytical result. 



It should be noted that, although we introduce a struc- 
tural perturbation into the network, the scale-free prop- 
erty is not affected and the power-law exponent 7 = 3 
is maintained. Numerical simulations shown in Fig. 7 
confirm this feature. 
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FIG. 7: (Color online) Illustration of the cumulative degree 
distribution of the GKE network with perturbation p = 0.00 
(squares), 0.01 (upward triangles), and 0.10 (circles). The 
fitted power-law exponent is 7 — 1 = 1.97 ± 0.02. Other pa- 
rameters for these simulations are m = 10, M = 20, and 
N — 2 x 10 4 . Each data point is obtained as an average of 
100 independent runs. The dashed line has slope —2.0 for 
comparison. 



FIG. 8: (Color online) Illustration of the clustering coefficient 
C as a function of number of active nodes M, with pertur- 
bation parameter p = 0.00 (squares), p = 0.01 (upward tri- 
angles), p — 0.10 (downward triangles), p — 0.50 (diamonds), 
and p = 1.00 (circles). The average fit slope for the simu- 
lations is 0.994. Other parameters for these simulations are 
m — 10 and N — 2 x 10 4 . Each data point is obtained as an 
average of 100 independent runs. The dashed line has slope 
— 1.0 for comparison. 



B. Universal scaling behavior of clustering 
coefficient 

In the following subsection, we investigate the depen- 
dence of clustering coefficient C on perturbation param- 
eter p. Analogous to the derivation of clustering coeffi- 
cient in GKE network without perturbation, we give an 
approximately analytical result. According to the mod- 
ification of the model, when a new node / with to links 
is added into the network, one of the links is attached to 
a randomly selected inactive node s with probability p. 
That is to say, with probability p, one of the neighbors of 
I is inactive. Since the system size N is large compared 
with M, we assume that node s is apart from the active 
nodes [28]. Thus, to — 1 possible links between neighbors 
of I are missed. Furthermore, node s is always apart from 
the afterward added nodes that are connected to node I, 
which causes another k — m possible links missed. Thus 
we have 

_ m-lxp (ki - to + l)(ki - to) 
~ M [1 kdk^l) ] 

to — 1 x p 2(m — 1) 
" P M h(h-l) 

TO— lXp2(fc;— TO) 

- p m k l{kl -iy (25) 



Similar to the derivation of Eq. (16), we have 

It is worthwhile to note that the scaling behavior 
C ~ 1/M is conserved though there exist certain fluc- 
tuations in the network which lead to a structure tran- 
sition. That is to say, C ~ 1/M is a universal scaling 
behavior of clustering coefficient induced by deactivation 
mechanism. Fig. 8 shows the log-log plot of the clus- 
tering coefficient C versus M with different perturbation 
parameters obtained by simulations. We can see that the 
perturbation has almost no effect on the scaling behavior 
of C, which agrees well with the analytical result. 

From Eq. (26), we know that, when p is sufficiently 
small, the clustering coefficient C has a linearly relation 
with p. Fig. 9 shows the simulation result of cluster- 
ing coefficient C as a function of perturbation parameter 
p, with m = 4 and M = 10. The slope found numer- 
ically is 0.223, slightly larger than the analytical result 
t7(tt ^ ^tt— ) = 0.21. The deviation is due to the ap- 

M K 6 30m ' " 

proximation[28] used in the theoretical derivation of C. 
It is clear that the node s is not always apart from all 
the active nodes, which actually causes less than to — 1 
possible links missed between neighbors of I. Thus, we 
can easily find the precise slope should be a little larger 
than that obtained from Eq. (26). 
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FIG. 9: (Color online) Illustration of the clustering coefficient 
C as a function of perturbation parameter p. The fit slope 
is 0.223. Other parameters for this simulation are m — 4, 
M = 10, and N — 2 x 10 4 . Each data point is obtained as an 
average of 100 independent runs. 

V. DISCUSSIONS AND CONCLUSIONS 

We first discuss the relationship between the numeri- 
cal behavior of clustering coefficient in the coauthor and 
paper citation networks and our theoretical result. Ac- 
cording to the model of Ref. [19], each of the authors 
and papers is assigned a topic, and authors can only 
cite, produce papers or coauthor with others in their own 
topic area. It means that, by topics the whole network is 
divided into many subnetworks which evolve separately 
and simultaneously. Each of the subnetworks can be re- 
duced to a GKE network and the number of these GKE 
subnetworks is just the number of the topics denoted as 
n. In each subnetwork, the number of active authors who 
are doing research or the number of active papers that 
are likely to be cited just corresponds to the number of 
active nodes in GKE network, which is denoted as M. 
Since the whole network is divided into n subnetworks, 
we intuitively know that the number of active authors or 
papers in each subnetwork is inversely proportional to the 
number of topics, i. e., M ~ 1/n. For each subnetwork 
can be treated as a GKE network, incorporating with 
our theoretical result C ~ 1/M, we can easily obtained 
that C ~ n. Since each subnetwork evolves parallelly, the 
clustering coefficient of the whole network has the same 
behavior that it is linearly correlated with the number 
of topics. Therefore, by using our theoretical result we 
can indicate that the numerical behavior of clustering 
coefficient in the coauthor and paper citation network 
presented in Ref. [19] is due to the deactivation mech- 
anism. Furthermore, in the above discussion, we reduce 



the aging mechanism to deactivation mechanism. In fact, 
in the model of Ref. [19], the aging effect is introduced 
by an aging function. To this point, we conjecture that 
there might be similar scaling behaviors of C in networks 
generated by other forms of aging mechanism. 

Finally, it is worthwhile to point out that, to our knowl- 
edge, no empirical data are available to illustrate the the- 
oretical scaling behavior of clustering coefficient. Never- 
theless, this interesting property is due to the deactiva- 
tion process which is a special case of aging effect. In net- 
work evolution, aging is a universal mechanism. There- 
fore, this simple theoretical result of C will have a rich 
practical significance and potential applications in future 
network research. Meanwhile, such scaling behavior of 
C should be given further considerations from empirical 
investigations. 

In summary, motivated by the aging effect governing 
the evolution of the coauthor and paper citation net- 
works, a generalized deactivation model of network called 
GKE is presented in this paper. We study analytically 
and by simulations several topological features of this 
model, such as the degree distribution and clustering co- 
efficient. Most importantly, an interesting scaling behav- 
ior of the clustering coefficient C ~ 1/M is obtained, 
which shows that the numerical result recently observed 
in the coauthor and paper citation networks is due to 
deactivation mechanism. By introducing a perturbation, 
the GKE network undergoes a small-world phase tran- 
sition, while the scaling behavior of C is conserved. It 
indicates that C ~ 1 /M is a universal scaling behavior of 
clustering coefficient induced by deactivation mechanism. 
In addition, we would like to emphasize that our study 
unifies the concept of regular lattice, small-world graphs 
and scale-free networks in a single model, and the GKE 
model generalizes the new class of the networks with a 
crucial parameter M. 

Since the GKE networks present peculiar structure 
property, it will be interesting to investigate the ef- 
fect of their complex topology features on the network 
dynamics[25, 29-31]. Especially, the clustering coefficient 
of GKE network is precisely tunable by parameter M or 
p without changing the degree distribution. Therefore, 
the model can be used to quantitatively study the ef- 
fect of clustering on network synchronization[32-34] and 
network epidemics [3 1,3 5]. Research along this line is in 
progress. 
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